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Abstract 

We study, in general terms, the process by which a pure state 
can "self-thermalize" and appear to be described by a microcanoni- 
cal density matrix. This requires a quantum mechanical version of the 
Gibbsian coarse graining that conceptually underlies classical statisti- 
cal mechanics. We introduce some extra degrees of freedom that are 
necessary for this. Interaction between these degrees and the system 
can be understood as a process of resonant absorption and emission 
of "soft quanta" . This intuitive picture allows one to state a crite- 
rion for when self thermalization occurs. This paradigm also provides 
a method for calculating the thermalization rate using the usual for- 
malism of atomic physics for calculating decay rates. We contrast our 
prescription for coarse graining, which is somewhat dynamical, with 
the earlier approaches that are intrinsically kincmatical. An impor- 
tant motivation for this study is the black hole information paradox. 



Introduction 



1.1 Motivation 



Understanding the principles of Statistical Mechanics from first principles 
still remains a challenge [p|-[]3q]] despite the monumental work by the pio- 
neers like Boltzman, Poincare etc. According to the fundamental principles 
of statistical mechanics a macro-system starting from an arbitrary initial 
state eventually tends to a state of so called thermodynamic equilibrium. 
Inherent in this assertion is a certain irreversibility and one of the main 
issues has been the reconciliation of this with the fact that basic dynam- 
ics is reversible. In the case of Quantum Statistical Mechanics there are 
additional complications as the thermodynamic equilibrium state, which is 
described by a mixed density matrix can never be obtained from a pure 
density matrix under quantum mechanical evolution which is described by 
unitary transformations. This last issue is also the crux of the so called 
Quantum Measurement Problem. Interest in these issues has recently been 
rekindled from a most unexpected direction i.e the problems of black hole 
entropy and of the so called black hole information. In this section we briefly 
describe each of these to bring into focus the issues we address in the rest 
of the paper 

1.2 Foundations of Quantum Statistical Mechanics 

The questions discussed above lead naturally to questions about the basic 
postulate of quantum statistical mechanics(QSM). The postulate is that of 
equal a priori probabilities for all microstates in a given macrostate of def- 
inite energy. Equivalently, the density matrix of an ensemble whose energy 
lies in the range Eq to Eq + AE is given by the identity matrix in the energy 
basis, suitably normalized as shown (N is the number of states in the given 
energy interval): 

P = jj I E <E<E + AE (1.2.1) 
and for E not in this interval, 

p = 

It is worth pointing out here that the standard derivation of the micro- 
canonical distribution can not be carried over automatically to QSM. In 
classical statistical mechanics one derives the microcanonical distribution, 
by maximizing the entropy: 

S = -k B f dX N p{X N )logp{X N ) (1.2.2) 



2 



subject to 

l = JdX N p{X N ) (1.2.3) 

In these eqns p(X N ) is the distribution function on the phase space. The 
result one obtains is 

p(X N ) = const E < H <E + AE (1.2.4) 

and zero otherwise. It is often stated that for quantum systems the proof 
is similar with the density operator replacing p, the quantum mechanical 
density matrix operator pj]: 

S = -k B trp\ogp =-k B J2PnlogP n (1.2.5) 

n 

where P n = {E, n\p\E, n)(the fact that p is a constant of motion has already 
been used). But for a pure state the eigenvalues of p are 0,1 (only one 
eigenvalue 1 and all others 0). Hence the entropy is exactly zero and no 
variational principle is applicable. 

It is sometimes argued that what one deals in practice is not p itself but 
the time-averaged p: 

1 r t+T , 

Pit) = = / dt p{t) (1.2.6) 
21 Jt-T 

It follows trivially that 

trp = \ (1.2.7) 
But trp 2 1 as can be easily shown using: p(t) = e lHt p(0) e~ lHt : 



p(t) = ^2 J dt dt P$ ) P<t ) (1.2.8) 

Though tr p 2 ^ 1 and therefore the time averaged p represents a mixed 
state, tr p 2 is calculable in terms of the initial density matrix: 

tr p{tf =£ \(n\pmm)\\ Si f^ Enm T) ) 2 (1-2.9) 

n,m v nm > 

In fact traces of all powers of p are calculable and found to be completely de- 
termined by the initial pure density matrix and the spectrum of the Hamilto- 
nian describing the system. The important point is that all these constraints 
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have to be used in the variational method and subsequently one does not 
get the microcanonical distribution. 

One can ask, what is the nature of a quantum mechanical system that has 
the property that, even though we start it off in a pure state, it evolves after 
some suitable time r to a state that for all practical purposes (which means 
for all observables of interest) gives the same expectation values as a system 
described by the microcanonical density matrix. To mimic the situation in 
classical statistical mechanics where we consider an initial configuration with 
energy lying in a narrow band, it is assumed that the pure state is a linear 
combination of energy eigenstates with energy between Eq and Eq + AE. 
We should also discuss what happens when the pure state itself happens 
to be an energy eigenstate. We will give an answer stated in terms of the 
properties of the exact eigenfunctions of the system. This will be the main 
result of this paper. It seems to us that there is an inbuilt dependence 
on the choice of "interesting" observables which define the coarse-grained 
microstates. 

1.3 Quantum Measurement Problem 

According to the von Neumann projection postulate, also known as the ' col- 
lapse of the wave-function 1 postulate, if an observable A is measured in a 
generic quantum state, the result will be any one of the eigenvalues of A and 
the state after the measurement collapses to the corresponding eigenstate. 
This implies a transition from the initial pure ensemble characterized by 

tr p = l,tr p 2 = 1 (1.3.10) 

before the measurement to a mixed ensemble with 

tr p=l,tr p 2 t^I (1.3.11) 

Let us illustrate this with a simple example. Consider an ensemble of states 
\ip > of a two- level system on which a measurement is done whose possible 
outcomes are s = ± with the corresponding eigenstates |+ >, |— >. As the 
latter span a basis for the two-level Hilbert space we could expand \ip > in 
this basis 

> = c+ |+ > +c_|-> (1.3.12) 

The initial density matrix 

p t = \i;><^\ (1.3.13) 
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is pure. After the measurement we have a mixture of two pure ensembles 
of states |+ >,|— > with weight factors |c+| 2 ,|c_| 2 resulting in the final 
density matrix 

p f = |c+| 2 |+ >< +| + | c_ | 2 |- >< -| (1.3.14) 

It is easy to check that 

tr p f = l,tr p) = (|c+| 2 ) 2 + (|c_| 2 ) 2 = 1 - 2|c+| 2 |c_| 2 < 1 (1.3.15) 

As Unitary transformations preserve all traces 

p = UpU^ -► tr p' n = tr p n (1.3.16) 

the measurement process is not describable by a unitary transformation and 
is thus irreversible. It is as if the measurement process has to be treated 
differently from ordinary dynamics. This is the crux of the so called Quan- 
tum Measurement Problem and one sees that once again the crucial issue 
has arisen of pure states evolving into mixed states, or more plausibly into 
pure states that for all practical purposes look like mixed states. 

1.4 Black hole evolution and AdS/CFT correspondence 

One of our strong motivation for this study comes from an issue related to 
the "information paradox" in black hole physics. The issue is the following: 
Consider the quantum mechanical description of black hole formation. Some 
matter/energy in some initial state described by a wave function evolves in 
time and at some point makes a transition to a black hole state due to the 
attractive gravitational interaction. As per the Hawking effect the black hole 
appears to be radiating like a blackbody at its Hawking temperature. The 
system does not look to an external observer to be described by a pure state 
wave function but rather by a mixed state density matrix characterstic of a 
thermal state. The usual argument is that this transition from a pure state 
to a mixed state is illusory because when we include the degrees of freedom 
inside the black hole one recovers a pure state description. Indeed string the- 
ory has given us a prescription in some situations for actually counting the 
number of microscopic states associated with a black hole which reproduces 
the Bekenstein entropy. Nevertheless, (and this is the crucial point), while 
ignoring the degrees of freedom in the interior of the black hole can make 
the pure state appear mixed to the external observer, this does not explain 
why it should look thermal. In other words, when calculating the entropy 
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by counting the number of states, there is an implicit assumption that the 
interior system is a mixture of states with equal a priori probabilities i.e it 
is ergodic and is described by a micro canonical ensemble. In ordinary sta- 
tistical systems there is always a "heat bath" one usually invokes - basically 
the environment - that will ensure this, but in the case of the black hole we 
are describing a closed system. There is no environment or heat bath. So 
the question is how does one justify such an assumption of ergodicity for a 
closed system? 

In |l|] this problem was approached using the so called AdS/CFT cor- 
respondence 0, J3|. Using this correspondence the gravitational problem is 
mapped to a Yang-Mills problem. How does a pure state with some fixed 
energy in a Yang-Mills theory "self-thermalize" ? The solution proposed was 
that chaos would do the job. Classical Yang-Mills has already been shown 
to be chaotic 13, 14]. One expects chaos to develop and make the system 
ergodic. Assuming this is true quantum mechanically also, this phenomenon 
could be mapped back to the black-hole via the same AdS/CFT correspon- 
dence. 

While not much is known theoretically about quantum chaos in Yang- 
Mills theories there is some experimental evidence in heavy ion collisions for 
the formation of a quark-gluon plasma (at finite temperature). The initial 
state, which is two heavy nuclei traveling towards each other, is definitely 
described by a quantum mechanical wave function. The final state appears 
to be describable by a system at finite temperature. If this happens, then it 
is quantum mechanical "self thermalization" - a pure state evolves unitarily 
by Hamiltonian evolution and after a while looks like a thermal state. 



1.5 Criterion for a Physical System to Become Ergodic. 

Given that we are able to characterize the nature of eigenfunctions for a 
system that is ergodic, we can ask when does a physical system satisfy these 
properties ? i.e. when do we expect thermalization? We will give an ap- 
proximate answer to this question in this paper. The centre of mass degrees 
of freedom are expected to look thermal most of the times. The interesting 
question concerns the internal degrees of freedom that are normally frozen 
to some discrete quantum states. In the Yang-Mills example above, we ex- 
pect thermalization of quark and gluon degrees of freedom to take place only 
when the energy density exceeds a critical density. The scale is obviously 
set by Aqcd- Similarly in the black hole example we do not expect every 
(zero temperature) neutron star to form a black hole and produce a non-zero 
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Hawking temperature. This is discussed further in sec. 2.6.4. 

1.6 Thermalization Time 

If a physical system is expected to become ergodic, we can ask what is the 
time scale over which this happens. This is also the time scale for return to 
equilibrium when the system is disturbed. It should of course be emphasised 
that depending on how the system is disturbed, there could be many such 
time scales. We will give approximate answers to such questions. 

1.7 Non-Quantum-Mechanical and Quantum-Mechanical Coarse 
Graining 

There have been several attempts to obtain effective thermal density ma- 
trices, or, to obtain irreversibility from reversibility. All these involve (as 
they must indeed) some coarse graining. However, to our knowledge many 
of these involve ad hoc prescriptions that do not arise naturally in quantum 
theory. For example, some invoke averaging over the time of measurement. 
The argument is that every measurement takes a finite time. Nevertheless 
averaging over time alone is not sufficient. As described in Section 1.2 it 
also does not give the right answer. Similarly, some invoke averaging over 
an ensemble of initial states. In quantum mechanics there is no need for 
any other ensemble than that required by the probabilistic interpretation of 
the quantum state. Thus averaging over some distribution of initial states is 
generically ad hoc and unwarranted. The point is that in generic situations 
there is no justification for such ad hoc averaging procedures. Thus what- 
ever coarse graining is necessary must arise naturally within the framework 
of quantum mechanics. We will see that invoking some unobserved "soft" 
(low energy) degrees of freedom can naturally accomplish the required coarse 
graining. An example of such degrees of freedom is the "soft" photons of 
QED. 

1.8 Outline 

In Section 2 we will describe our proposal for Gibbsian coarse graining in 
quantum mechanics using soft quanta. This will address the criticisms of 
section 1.7. We will also give a physical criterion for thermalisation- a qual- 
itative answer to the question of Section 1.5. In Section 3 we introduce the 
mathematical formulation of the problem. We also give a brief description 
of the work of von Neumann and van Kampen on this subject (about which 
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we learnt well after completing our work ). This is included mainly for com- 
pleteness and comparison, and is not logically required for understanding 
our work. In Section 4 we discuss the issues raised in Section 1.2 regarding 
the characteristics of a system that can be described by quantum statistical 
mechanics. In Section 5 we look at the same question from a different per- 
spective - that of soft quanta and resonant transitions. In Section 6 we will 
discuss a two level system coupled to a continuum of soft quanta states. This 
will illustrate some of the ideas more quantitatively. By using the quantum 
mechanical formalism underlying the familiar "Fermi Golden Rule" , we will 
see approximate irreversibility coming out of a reversible dynamics and the 
approximate emergence of a thermal microcanonical density matrix. More 
quantitative answers to the questions of Section 1.5,1.6 and 1.7 will be given 
here. Section 7 will summarize the results of this paper. 

2 Gibbsian Coarse Graining and the Importance 
of Soft Modes 

2.1 Classical Gibbsian Coarse Graining 

An important ingredient in classical statistical mechanics is the notion of 
coarse graining. As was originally argued by Gibbs @, unless the microstates 
are coarse grained, entropy will always remain constant. This is easy to 
see - as the region of phase space ( energy surface) that is occupied by the 
ensemble (call this region T*; care should be taken to distinguish T* from 
r which usually denotes the entire energy surface) spreads out in phase 
space, its shape becomes very complicated but its volume remains fixed. 
The entropy is given by S = J2i Pd n Pi where i labels the microstate and pi 
is the probability that the system is in this state. If microstates are taken 
to be points, then pi = 1 if the point i belongs to T* and zero otherwise. 
It is easy to see that this number is constant because it depends only on 
the volume of P", not the shape. If, on the other hand, the microstates 
are taken to be small boxes(but large enough to have many system points 
within them) in phase space, then pi will be equal to the fraction of the box 
that is inside P*. In this case, it is clear that as T* spreads, most of the 
Pi's will become equal to each other and approach a value between and 1. 
Thus S will increase till it reaches a maximum. 
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2.2 Quantum Mechanical Gibbsian Coarse Graining- "Soft 
Quanta" 

What we need is the quantum mechanical analogue of this coarse grain- 
ing. It is important that one should stay within the formalism of quantum 
mechanics while describing this coarse graining. We propose the following 
scheme: Let us consider our Hilbert space to be a tensor product of two 
Hilbert spaces TLi and TL a i.e H = Tii <8> Tt a - States in Tt are the microstates 
of our system, i are the degrees of freedom that one is physically interested 
in and will represent the "coarse grained" microstates. a represents some 
degrees of freedom that we are not interested in and possibly over which we 
have no control. These are the degrees that allow us to coarse grain. As an 
example consider a gas of molecules, i could be the usual microscopic degrees 
of freedom that one associates with the gas, the positions and momenta of 
the molecules. One can also include rotational or vibrational degrees if one 
wants, a can be soft photons, i.e. the molecules interact with the electro- 
magnetic field all the time and constantly emit and absorb radiation. There 
are very long wavelength photons of almost zero energy that one has no 
control over. They constitute a continuum of gap-less excitations in this 
system. They interact with the molecules but take away negligible energy. 
The energy hyper-surface defining the microcanonical ensemble in quantum 
mechanics always has a small but finite width. We can understand this as 
follows: We can specify the variables i. The a variables are not in our con- 
trol. We can think of i as labeling an energy eigenstate of the i system when 
there is no interaction with the a system. When we turn on interactions 
with a, the state i is no longer an energy eigenstate. It becomes a linear 
combination of states with energy in a range AE. This is an estimate of AE 



in ( 1.2.1 ). The observables of interest O will be assumed to be functions 



only of i and not of a. To be precise we assume that 

(i,a\0\j,b) =5 ab (i\0\j) (2.2.1) 

Thus our coarse graining will be defined by saying that these are the 
operators of interest and it is with respect to these operators that the sys- 
tem looks thermal. Note that this is different from the following kind of 
coarse graining: the observable O depends on both i and a, i.e., (i,a\0\j,b) 



is not necessarily given by 2.2.1. Nevertheless, the a variables are not in 
our control. So we average over them in some fashion - for e.g. Oj, = 
/ dadb(i, a\0\j, b)p at b- This defines a coarse graining. But this is ad hoc and 
definitely is not a quantum mechanical operation. We do not want this type 
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of coarse graining. 

There is some arbitrariness in what we call i and what we call a. But 
it is crucial that the a degrees are gap-less, or at least the gap 5 a between 
two consecutive energy levels of the unperturbed a system should be much 
smaller than the inverse of the time interval during which we observe the 
system. Thus for instance, if Si = for some of the i-states, then some of 
the i degrees of freedom could be called a. Thus if i represents a continuum 
of harmonic oscillators, then some subset of these around zero frequency can 
be called the a variables. Since our energy resolution is always finite, these 
are not observables of interest. 

2.3 Paradigm of Discrete States Coupled to a Continuum 

If it is the case that we can describe the exact system as a coupled i-a system 
with i being discrete and a being continuous, then we are familiar with this 
in atomic physics situations. In this situation if we focus on the discrete 
system alone it will appear to be described by a non-Hermitian Hamiltonian 
with complex energy eigenvalues. The width T represents the finite lifetime 
of the (excited) states. This requires that we take the limit S a — ► 0. If we 
keep 5 a finite then we expect that after a finite time of 0{j-) the system 
will look periodic. Thus the apparent irreversibility is only because the time 
of observation is short compared to j- . Conversely if we are to observe an 
irreversibility, as in the second law of thermodynamics, it is clear that we 
need such soft degrees of freedom. "Soft" in this context means that the 
energy of these quanta should be less than the bandwidth, AE, that defines 
the microcanonical system as : Eq — AE < E < Eq + AE. Thus we want 
Tllo < AE. In all the usual physical systems there are always soft quanta. 
So this is not an issue. 

2.4 Heuristic Criterion for Ergodic Density Matrix 

However the above condition is not sufficient. Arguments in the next section 
suggest that matrix elements between \i) and \j) induced by the interactions 
with a should be much larger than Si. This ensures that an exact energy 
eigenstate contains a large number of different i states. This is not always 
satisfied. Systems that do not satisfy this will not be ergodic. In a quantum 
system with an energy gap (particles in a box) one does not expect any kind 
of ergodicity when the total energy of the system is such that only the lowest 
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energy levels are excited. 



2.5 Resonance and Soft Quanta 

When the energy of the soft quanta is equal to Si, we expect resonant ab- 
sorption/emission to take place. This induces the large off-diagonal matrix 
elements between \i) and \j) states that we referred to above. Because of 
resonance, the coupling between the soft quanta and the i system need not 
be large for this to happen. Furthermore we expect both states \i) and \j) 
to be equally populated if the probability, Pij of \i) — > \j) is equal to Pji. 
This will be the case when the induced emission dominates the spontaneous 
emission, which means that there should be a large number of soft quanta 
available, i.e. the energy/degree of freedom in the a-system should be much 
larger then the largest of the energy gaps Si which is AE. Thus the criterion 
for thermalization is: 

i) there should be a continuum of soft quanta with energies in the range 
Si to AE. This ensures resonant transitions. 

ii) The number of soft quanta in each mode should be >> 1. This ensures 
the equality of the upward and downward transitions and hence equality of 
occupation probability. 

In section 6 we will study a quantum system where some calculations 
can be done perturbatively. The results support the general picture. 

2.6 Examples: 
2.6.1 //2-molecules 

Let us turn now to some physical examples to illustrate these ideas. Consider 
a gas of hydrogen molecules. Clearly, as far as the centre of mass degrees of 
freedom of the molecules are concerned the system is likely to be ergodic. 
This is again a situation where Si = 0. When Si = the off-diagonal elements 
of the full Hamiltonian are clearly larger than Si and one expects ergodic 
behaviour by the above arguments. The centre of mass motion is in any 
case ergodic. 

Let us focus our attention on the relative coordinates of the atoms in 
one molecule and consider this as the i system. We would like to understand 
whether "self thermalization" can take place for this sub-system. We could 
alternatively consider all the molecules, but the interaction between the 

x We remind the reader that we are discussing closed systems and there is no heat bath. 
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internal degrees of freedom being negligible, this will just be many copies of 
the one molecule system. The a states are a subset of the degrees of freedom 
of the rest of the gas molecules that interact with the internal degrees of this 
molecule. We will coarse grain over these degrees by considering observables 
that depend only on i. We assume that the energy exchange between the 
internal degrees of this molecule and that of the rest of the gas molecules 
(which includes the 'a' degrees) is very small - smaller than the resolution 
of our experiment. It therefore makes sense to talk of the micro canonical 
density matrix for the z-system. Let us assume that the kinetic energies of 
the molecules are small and all center of mass degrees can be included in 
the a-system. 

We assume that the potential energy and the energy levels of this system 
are as shown in figure 1. The width of the dashed line represents the energy 
resolution AE that defines a microcanonical system. The energy of the soft 
'a' quanta has to be less than AE 

If the energy of the system described by the Hamiltonian for relative 
motion is in the region shown by the dashed line marked "a", where 5i is 
fairly large compared to AE we do not expect ergodic behaviour. We expect 
that if we start the molecule off in a pure state, such as the ground state or 
first excited state, it will remain there. This follows from purely energetic 
considerations. 

The region marked "c" is a continuum where the molecule has dissociated 
and we have atomic Hydrogen. In this region <5, ~ << AE and one expects 
ergodicity. 

Region marked "b" is in between, with 5{ ~ AE and one can expect 
ergodicity if the off diagonal matrix elements are large compared to <5j. As 
long as there are soft quanta that can be absorbed or emitted whose energies 
match the energy spacings (<5j), this will be the case. The soft quanta are 
provided by the inelastic collisions with other molecules. We thus expect 
that even if we start the system off in a pure state it will soon make tran- 
sitions to the numerous other states in that energy band and be effectively 
described by a microcanonical density matrix. This is what we would like 
to demonstrate in Sections 4,5 and 6. 

As the above example illustrates, the centre of mass degrees in a gas, 
for instance, typically are expected to be ergodic. They correspond to the 

2 In a realistic system it may be that these conditions are satisfied at such low "temper- 
atures" that the f/2-gas may have liquefied. But these considerations can apply for any 
state of matter. 
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Figure 1: In the energy range "a" the system is unlikely to be ergodic. In 
"c" it is very likely ergodic, and in "b" it is almost ergodic. The width of 
the dashed line represents AE 
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5i = case. Classical analysis of "billiard balls" in certain situation shows 
ergodicity even with a small number of balls |J. We need not expect the 
quantum behaviour of this weakly interacting system to be very different. 

The difficult question concerns the non-center-of-mass degrees that have 
non-zero <5j. Are they ergodic or not? It can happen sometimes that while 
the center-of-mass degrees are ergodic, the non-center-of-mass degrees are 
in a pure state. The point we are trying to make is that the value of the 
energy spacing 5i, is a crucial factor in deciding the answer. 

2.6.2 QCD Plasma 

Another example is that of heavy ion collision producing quark gluon plasma 
Clearly when the energy of collision is smaller than the energy gap in 
QCD the hadrons are still usefully described by wave functions. Once this 
gap is exceeded one can expect thermalisation. What are the soft quanta 
here? Presumably these are soft gluons. There are some points of simi- 
larity between this system and the hydrogen molecule example discussed 
above. Deconfinement here, presumably corresponds to the dissociation of 
the molecule there. 

2.6.3 Sodium Vapour 

Let us take another example, where again we expect to see two different kinds 
of ergodicity. As it turns out this example has some similarity with the black 
hole situation to be discussed later. Consider a vapour of sodium atoms. 
The centre of mass motion of the atoms is "gap-less" and can be expected 
to be ergodic motion, even at low temperatures. The electrons in the atom 
however are described by pure state wave functions, at temperatures that 
are low compared to electronic level spacings. Consider what happens when 
the vapour cools down sufficiently that it forms a Sodium crystal, at some 
low temperature. The centre of mass motion is no longer ergodic. However 
sodium being a metal, the outer electrons are free to move around in the 
crystal. They are now described by a Fermi gas and their motion is ergodic! 
One can presumably associate a temperature with them. The gap <5j between 
electronic levels has changed from, say, a few electron volts to something of 
the order where L is the macroscopic size of the crystal. This is a quantum 
mechanical effect. Thus a phase transition in the material has introduced 
new gap-less excitations that demonstrate ergodic behaviour, even though 
at a higher temperature they were not ergodic (because they were confined 
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to the interior of the sodium atom)! 
2.6.4 Stars and Black Holes 

There is a similarity between the sodium vapour example and the black hole 
example. When matter collapses to form a star the centre of mass motion 
of the lumps of matter once again can be assumed to be ergodic and as a 
consequence we get a hot star. (Indeed astrophysicists routinely treat this 
motion entirely classically using classical statistical mechanics, until we get 
to high densities.) At some point the star (if sufficiently massive) cools 
to essentially zero temperature, undergoes a phase transition and forms a 
black hole. At this point some other internal degrees of freedom (D-branes 
in string theory black holes) become (presumably gap-less and) ergodic and 
there is a temperature that can be associated with them. These are like the 
outer electrons of the sodium atom. 

2.7 Summary 

Let us summarize the points made in this section: 

1. Coarse graining is done by introducing soft quanta a in addition to 
the conventional microscopic degrees i. These soft modes induce transitions 
between the The gap 5 a must be very small compared to the inverse time 
of observation. Also the off diagonal terms in the Hamiltonian connecting 
different i states must be much larger than the unperturbed gap Si. This is 
necessary for the exact energy eigenstates to satisfy the properties assumed 
in Section 3. This is thus necessary for ergodicity. In the resonance picture 
this will automatically be the case if there is a sufficient number of soft 
quanta with a continuum of frequencies in the appropriate energy range. 

2. The interaction with the continuum a degrees of freedom, induces 
an anti-Hermitean part to the effective Hamiltonian of the i system. This 
makes the evolution look irreversible. The apparent lack of reversibility has 
to do with treating the a-degrees as continuous. If a gap S a is introduced, 
the recurrence time is O(j-). 

In section 6 we will calculate, in a simplified model, the time evolution 
of the partially traced density matrix that describes the i system and show 
how it appears to evolve in time to a thermal one. 
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3 Mathematical Formulation of the Problem and 
Earlier Approaches 



In this section we give a mathematical formulation of the problem we wish 
to address in this paper. It is helpful to start with a brief recapitulation of 
the situation in classical statistical mechanics. In particular we shall follow 
the Gibbsian route formulated on the classical phase space and even there 
we will be concerned only with the micro canonical ensemble description. 

In such a situation one considers an isolated system with its total energy 
in a narrow band Eq — A < E < Eq + A. Under time evolution governed 
by the dynamical equations the system moves on this hyper-surface of the 
6iV-dimensional phase space called the "energy surface". Different regions 
of this energy surface can be labeled by values of observables other than 
the Hamiltonian. Generically these observables are time-dependent. Fur- 
ther, the accuracies with which these observables can be determined make 
it meaningful to decompose the energy surface into elementary cells called 
"phase cells" and all system points within the same cell are ascribed the 
same value for the "coarse grained" observables Due to dynamics the sys- 
tem point moves from phase space cell to phase space cell. The crux of the 
Gibbsian statistical mechanics is the so called ergodic theorem ( more pre- 
cisely the quasi-ergodic theorem) which states that after a sufficiently long 
time the system point passes through all the phase cells and furthermore the 
time spent in each phase cell is proportional to its volume. This immedi- 
ately allows one to equate the time average of observables with an " ensemble 
average" wherein the weight factor for each phase cell is its volume. As the 
phase cells can be constructed with equal volume the ensemble average can 
be taken with a distribution giving equal weights to the different phase cells. 
This is the microcanonical distribution. The crucial point is that the time 
average has been replaced by an ensemble average in such a way that the dis- 
tribution is independent of the initial state. It should be emphasized that in 
addition to the quasi-ergodic theorem one also approaches the same problem 
through the so called H-theorem and the concomitant concept of entropy.In 
what follows we shall only concentrate on the quasi-ergodic aspects of the 
problem. 

Having stated all this it is very important to emphasize that a precise 
proof of the quasi-ergodic theorem and in particular the resolution of the pre- 
cise role of dynamics is a very difficult problem. Both in this formulation as 
well as in the Boltzmannian approach to statistical mechanics assumptions 
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have to be made that are equivalent to the assumption of chaotic behaviour. 

3.1 Formulation of Quantum Statistical Mechanics. 

Now the main question is how to formulate and prove similar statements 
in the context of quantum mechanics. The foremost difficulty here is that 
unlike as in classical mechanics there is no concept of a phase space nor of a 
trajectory.If classically one views the phase space as the space of all possible 
states of the classical system, the natural analog in the quantum case is the 
Hilbert space. Already at this stage many crucial differences appear; one 
such is the fact that for even a simple system like the ideal gas while the 
classical phase space is finite- dimensional with dimension 6iV, the quantum 
mechanical Hilbert space H is the tensor product of N copies of Hilbert 
spaces Hi each of which is infinite- dimensional. 

Let H be the exact Hamiltonian of the system and let \A) be the exact 
eigenstate with eigenvalue E A - Now Consider energy eigenstates such that 
their eigenvalues are in the range Eq — A < Ea < Eq + A. Further, let the 
initial state of the system \ip) be such that 

W) = J2ci\A) (3.1.1) 

A 

Then 

\m) = E C t e~ tEAt \A) (3.1.2) 
A 

If we denote C^(i) = C^e~ lEAt it is obvious that |C,i(t)| 2 are independent 

of time for all A. The average energy of the system at any time t is given 
by • . • . 

E(t) =Y / \C A (t)\ 2 E A (3.1.3) 

A 

^From eqn ( |3.1.3| ) we see another principal difference between the classical 
and quantum situations: in the quantum case not only is E constant in 
time, but so are all the quantities | | 2 . If we now decompose C A (t) into 
its magnitude ta and its phase 9a we find that while ta does not change 
with time, 6 A {t) = 9 A (0) - E A t. 

^From this it follows that under quantum dynamics i.e Schroedinger 
equation, the motion of the system point is not over the entire Hilbert space 
but over the subspace defined by constant |C j 4(i)| 2 for each A. The mo- 
tion is in fact on the iV^-torus spanned by the angle variables 6a where N A 
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is the number of (non-degenerate) energy eigenvalues in the interval consid- 
ered. Furthermore, the motion on this iV^-torus is such that the angular 
velocity is constant in every direction and equal to —E A . For a macro- 
scopic system the Hamiltonian H will be generically so complex that this 
motion will densely fill the entire iV^-torus and because of the uniform ve- 
locity in every direction the time spent by the system point in any interval 
[{$a}j{#A + ^#a}] is exactly proportional to the volume of the interval. In 
this sense the quantum motion is quasi-ergodic. The precise conditions to 
be fulfilled by the spectrum of eigenvalues of H will be discussed later but 
it suffices to stress here that they are fairly generic and unlike the classi- 
cal case do not require special assumptions about chaos. Now consider the 
expectation value of an observable O in the state \ip(t)): 

(m\0\m) = E C b Ct e-^~ E ^ BA (3.1.4) 

A,B 

and a time-average of this expectation value over a duration T centred at 
time r 

o{r) = ± m)\A\m) 

= E C A O ba " y (3.1.5) 

%~X ~ E B )1 

Now the main problem is that this expression has an explicit dependence on 
the parameters c\ of the initial state and can not be replaced by an ensemble 
average that is insensitive to the initial state. This is the mathematical 
formulation of the problem to be solved i.e interpret quantum statistical 
mechanics in such a way that the time-average does not remember the initial 
state. 

It is necessary to make more precise the notion of the time-average. Note 
that 

The time average in eqn (|3.1.5[) then becomes 

0(r) = J2\Ci\ 2 O AA . (3.1.7) 

A 

At this stage nothing more can be said. If we however consider some spe- 
cial class of systems it is possible to make more statements. For example, 
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Berry [32] has conjectured that for classically chaotic systems energy eigen- 
functions behave like Gaussian random variables. More precisely consider 
expanding the exact energy eigenstates \A) in some orthonormal basis \I) 

\A) = J2C I A \I) (3.1.8) 
i 

Then the Berry conjecture amounts to saying that C A are independently 
distributed random numbers. At this stage it is sufficient to just specify the 
two-point correlation of this distribution 

(C%C J B ) ens = -^5 A b5ij (3.1.9) 

IN a 

It is easy to see that eqn ( |3.1.9 ) is compatible with unitarity. Despite the ap- 
parent basis dependence of this criterion, it is actually basis independent.To 
see this let us expand \A) in another orthonormal basis I' 

\A)=j2C I A\l') (3.1.10) 

with 

\I')=Bj'\I) (3.1.11) 

Then 

Ci'=5>f'ci (3.1.12) 



and 



K^A L'B/ens — / , -"J &j {L. A ^ 



*I ,~iJ \ 

Bl ens 

IJ 

^^BfBj'SAsSu 
^ A u 



= -^t^ab^i'j' (3.1.13) 

where we have made use of the unitarity of the transformation matrix Bj' . It 
should be noted that this proof works only when the bases I, I' etc. are not 
derived from the energy basis by application of fixed unitary transformations 
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on the energy-eigenfunction basis. Now let us consider the ensemble-average 
of eqn ( gT7|) : 

\Ca\ 2 Oaa) ens — E] | | 2 (Cj^ C^) ens Oj J 
A AIJ 

- j- A ZO„ (3.1.14) 

Let us also consider the average of O at a particular instant given by eqn 
(1X1): 

(Y,Ctciz- i{EA ~ EB)t OBA)ens = E ^ C^C*/ C J B ) ^ je~^- E ^ 
AB ABU 

= W- E C^fctSABSue-^-^Ou 

IVA ABU 

= fE«o„ 

^ A AI 

= (3.1.15) 

which is the same as eqn ( |3.1.14j ). Thus in these cases the time-average is 
equal to the value at any given instant of time once the ensemble average 
is taken and in both of them all memory of the initial state is lost. The 
drawback with this picture is that the equality seems to hold at arbitrary 
times t while one should only expect it at late times. In other words there 
is no way to understand " thermalization times" in this picture. 

For eqn ( |3.1.6| ) to hold it actually suffices for T > > i where 5 is the spac- 
ing between the exact energy eigenvalues. As this spacing typically decreases 
exponentially with the size of the system the corresponding T is exponen- 
tially large in system size. This is analogous to the classical case where for 
very very large times the Poincare recurrence theorem |^/ guarantees the 
equality of time and ensemble averages. 

Clearly what's important are not such astronomically large time-scales. 
Instead, consider the more moderate and intermediate time-scale 
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< T « i: then one has 

sm.(E m — E n )T 
(E m — E n )T 

and consequently 

O(r) = £C? Cj e-^-^^^ (3.1.16) 

Again if the spectrum of the Hamiltonian is sufficiently complex and there 
are no degeneracies, for any reasonable value of r there will be cancelations 



among all the A ^ B terms and one again gets back to eqn ( 3.1.7 ). Actually 
for such complex systems even the average at any particular instant (except 
for pathologically small values of r) becomes the same as eqn ( FTPD and 



one does not even need to do any time averaging. But eqn ( 3.1.7[ ) retains 



memory of the initial state and at this stage it is not possible to recover the 
basis of statistical mechanics unless one postulates additional assumptions 
like Berry's conjecture etc. (we will show later ways of going beyond such 
restrictive assumptions; that will be the main result of this paper). 

In [31] there was a proposal to consider a different approach to this 



problem by assuming that observables of interest are of the type: 

Oba = O 5 BA + Rba (3.1.17) 

with Rba being "small". 

Now too (ip(t)\0\ip(t)) is of the same form as eqn(3.1.7) except for small 



corrections due to the Rba terms. Once again if we interpret this restriction 
in the Schrodinger picture one would get eqn Q3.1.7 ) to be true at all times 



which does not make much sense. A possibility is to interpret (ip(t)\0\i[)(t)) 
as (ip\0(t)\ip) in the Heisenberg picture and assume eqn (|3.1.17|) only for 
late times. Even then one has to still introduce some "ensemble average" 
along the lines of Berry's conjecture and there seems to be no scope for 
addressing the issue of "thermalization time" at all. We will see later that 



eqn ( 3.1.17 ) is too strong a restriction on macroscopic observables and in 
fact implies that in the classical limit these observables are constant on the 
entire energy surface. 

3.2 Earlier approaches 

Much after we had finished our work( to be described in detail in sees 4, 5 
and 6) we came to know of the fundamental paper on this subject due to J. 
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von Neumann [|T(]] through ref [22]. Subsequently we have traced Pauli and 
Fierz's [11] sequel to von Neumann's work as well as Van Kampen's formula- 
tion |12| of the problem. There are differences between von Neumann's and 
Van Kampen's approaches but they are essentially similar in spirit and both 
give a quantum mechanical equivalent of the classical phase space cell de- 
composition. While von Neumann uses fairly rigorous methods to show that 
the time average of the expectation values of the macroscopic observables 
(defined suitably by him) approaches arbitrarily closely the microcanonical 
average ( he also shows that a suitably defined entropy, different from what 
is known as von Neumann entropy in the literature, also approaches the en- 
tropy of the microcanonical distribution which is the quantum version of the 
H-theorem) . Van Kampen establishes a master equation for the probability 
distribution for finding the system in various phase cells given the distribu- 
tion at t = and argues that it has all the properties of a Markov process 
whence the problem becomes identical to the one in classical statistical me- 
chanics. While in von Neumann's treatment the issue of thermalization time 
is not very transparent, in Van Kampen's treatment it can be handled in 
principle exactly as in classical statistical mechanics. 

While the mathematical formalism in all three approaches have strong 
overlaps, our treatment is rather different in its interpretation of coarse 
graining in quantum mechanics which is somewhat kinematical in origin in 
both von Neumann's and Van Kampen's treatments. We seek a dynamical 
origin for this coarse-graining. In our approach, the extra soft quanta not 
only play a passive role as the degrees that are coarse grained away, they 
directly provide the mechanism for thermalization. Thermalization takes 
place due to emission and absorption of these soft quanta. Furthermore 
in order for this thermalization to happen it is crucial that they have the 
properties described in Sec 2.4 and 2.5. Thus the reader will see that the 
setup of Section 4, where the solution to the problem of ergodicity is given 
in an abstract way is, perhaps not surprisingly, mathematically identical to 
the setup of von Neumann. Both of them describe a quantum mechanical 
analogue of Gibbsian coarse graining. Section 5 and Section 6 where the soft 
quanta play out their role in thermalization have, no obvious counterpart in 
von Neumann's discussion. 

We include a short description of the works of von Neumann and van 
Kampen for reasons of completeness and historical accuracy. This should 
also provide a perspective for the whole discussion. This section is not 
a prerequisite for understanding the rest of this paper. For convenience 
we have given at appropriate places below our notation used in Section 4 
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corresponding to each of von Neumann's symbols. 



3.2.1 Quantum Statistical Mechanics of von Neumann 

The three essential ingredients of this approach are i)energy-surfaces,ii)phase 
cells belonging to a particular energy surface and iii)micro-states spanning 
each phase cell. 

Energy Surfaces:The exact Hamiltonian of the system is taken to be 
H whose spectrum is assumed to be discrete for simplicity.If AE is the 
macroscopic resolution with which energy measurements can be made, the 
energy levels are divided into groups of width AE each and the groups 
are labeled by a = 1,2,.... This means that only energy levels belonging 
to different groups are macroscopically distinct. The energy eigenvalues are 
labeled W p>a with p = 1,2, ...,S a . (In Section 4 we have used A to label 
the exact energy eigenstates. S a corresponds to Na of Section 4, where 
we have restricted ourselves to one energy surface.) The eigenfunctions are 
labeled (pp,a- Thus S a is the number of microscopic states spanning the 
energy surface a. The projection operator corresponding to cj) p ^ a is denoted 



Macroscopic Observables: von Neumann considers all macroscopic 
observations to be simultaneous and therefore all macroscopic observables 
to be mutually commuting. (The word "macroscopic" is used here in the 
sense of Quantum Measurement Theory. It does not mean macroscopic 
in the sense of statistical mechanics (i.e. pressure, density, etc ). Indeed 
they correspond to micro-state variables such as the position coordinates of 
molecules of a gas except that macroscopic measurements can not resolve 
their values within a phase cell. These observables are thus a subset of the 
O of Section 4). Their simultaneous eigenfunctions are denoted by uj\ p , A = 
1,2, ..,s p where distinct values of p denote distinct values of the observables 
but all states with the same p but different A have the same values for all 
observables. (In Section 4 we have used 'a' for A, and for p. s p is N a of 
Section 4. The simultaneous eigenstates there are denoted by \i,a).) 

The projection operator for the state uj\ jP is denoted by P^ A and the 
density matrix signifying an equal mixture of tu\ iP for the same p but for all 
possible values of A is denoted by 



byP 0: 



p. a 




(3.2.18) 



Eqn ( p. 2. 18] ) can be taken as the density matrix for a " phase space cell" . In 
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von Neumann's paper the explicit construction of these mutually commuting 
macroscopic observables is not given and the discussion not very transpar- 
ent. Van Kampen treats this in a more transparent manner. Consider any 
observable O and express it in the exact energy-eigenfunction basis 

0= £ O mb \4> p , a ){^ b \ (3.2.19) 

p,a,a,b 

Van Kampen argues that the matrix elements of the time-dependent oper- 
ator 0(t) 

0(t) mb = O mb e-^- w ^ (3.2.20) 

are associated with very rapid fluctuations whenever a ^ b. Thus it makes 
sense to replace the microscopic operator O by 

0'= E O pm \^ a ){^ a \ (3.2.21) 

Equivalently 

Op,a;a,b = O p ^;a,a Kb (3.2.22) 

Van Kampen also introduces the coarse-grained energy operator to be 

Sa 

h' = E W «E I V« ( 3 - 2 - 23 ) 

a p 

Note that all the energy eigenvalues belonging to an energy-surface have 
been replaced by a single value W a . Now it is easy to see that H', O' com- 
mute so they can be simultaneously diagonalised: 

H'\u^ a ) = W a \u^ a ) 

0'\u^ a ) = A^u^a) (3.2.24) 

He now introduces the coarse-grained observables by grouping the eigenval- 
ues A^a into phase cells in such a way that only those belonging to different 
groups(cells) can be distinguished macroscopically i.e through macroscopic 
measurements. Let these groupings be labeled by v = 1,2, ...N a (N a of von 
Neumann is Ni of Section 4). Then 

o = EE^E p ^,. ( 3 - 2 - 25 ) 

a u A 
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Now it is clear that all such macroscopic observables commute with each 
other with uj\ v a as their simultaneous eigenfunctions. Furthermore, in this 
approach, the phase cells, labeled by v, a are neatly partitioned into disjoint 
sets by the energy surfaces. 

It is instructive to compare this construction with von Neumann's treat- 
ment ( and later with ours), von Neumann argues that any operator with a 
boolean spectrum( eigenvalues or 1) is a macroscopic observable and every 
macroscopic observable has a spectral decomposition of the type 

A = c P E p (3.2.26) 

p 

He then considers the function f a (x) such that f a = 1 if x belongs to the 
energy eigenvalues of the energy surface labeled by a and otherwise. Then 
the operator f a (H) has eigenvalues 0, 1 and is thus a macroscopic observable. 
By eqn (|3.2.26| ) it must admit the decomposition 

= £# E p (3-2-27) 

v 

On the other hand 

Sa 

/ a (#) = E P ^ (3-2.28) 

p=l 

As both f a (H) and E p for every p are equal to their own squares and as the 
product of two distinct E p 's is zero, it follows that ff = fP for every pi.e 
f£ is either 1 or 0. Relabeling all those p's for which = 1 by is, a we have 

Sa N a 
P V 

i.e we again have a unique partitioning of all the phase cells among the 
energy surfaces. The equivalent of this in our approach is given by the 
construction in sec. 4.1. 

Microcanonical ensemble: 
We briefly sketch JvN's proof showing the asymptotic equivalence of the time 
averaged expectation value and the microcanonical average. We denote the 
operator occurring in eqn (|3.2.29 ) by A a . It is then clear that ^-A a is 



the density matrix for an equal mixture of energy eigenstates belonging 
to the energy surface a and as such represents the density matrix for the 
microcanonical ensemble associated with that energy surface. If the initial 
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state ip is such that (A a ip, ip) is nonzero for several values of a, one should 
consider a mixture of microcanonical ensembles for each a with weight factor 
(A a ^,^)(this is the probability of finding the system in the energy surface 
a). Thus the microcanonical density matrix in the general case is 

U* = ± { -^^A a (3.2.30) 
0=1 ba 

In our treatment later on we shall consider only one energy surface. 
Quantum Ergodicity 

Next JvN considers the system to be initially in a state 

OO S a 

W = EE r p , a e ia ^ \^ a ) (r p , a > 0, < a p , a < 2tt) (3.2.31) 

a=l p=l 

then 

OO S a 

M*)> = E E r p , a e- i ^ w ^ t - a "^ (3.2.32) 
0=1 p=i 

Introducing the abbreviations 

x v ,a = (E^(t), VW) u a = (A ^(t), m) = (A a V>, (3-2.33) 
one considers a macroscopic observable ^4 

00 AT a 

A = E E ^.« E ",« ( 3 - 2 - 34 ) 
a=l i/=l 

The expectation value of A in the state ip(t) is then 

00 N a 00 

^(^(t)) = (AmMt)) = EEvMW.#)) = E E 

a=l i/=l a=l u=N a 

(3.2.35) 

while the microcanonical average is 

m ft 00 AT Q / 

a=l i/=l a '=l i/'=l a ' 

OO N a 

= EEV^ (3-2.36) 

a=l ,/=l ° a 
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On using the eqn ( |3.2.36| ) and Schwarz's inequality JvN finally obtains 

(E A (m) - E A mf < ff £ E — [^,- - (3-2-37) 
where 

f = EE (3.2.38) 

a=l !/=! D a 



JvN finally establishes that the time average of the lhs of eqn ( 3.2.37 ) is 

Sa 



bounded by r/ 2 max a {^^-} 



3.2.2 Van Kampen's approach 

Van Kampen expands the initial state ip not in terms of the exact eigenfunc- 
tions of the Hamiltonian but in terms of the simultaneous eigenstates oo\ iUA 
of the coarse grained observables( we are retaining the notation of JvN): 

\i>) = E & V,>A,,, a ) (3.2.39) 

\,i/,a 

The probability of finding the system in the phase cell u, a is given by 

Sis, a 

Pu,a = E l 6 ^,a| 2 (3-2.40) 
A=l 

The time dependence of the state is given by 

\m) = E (3-2-41) 

where 

b\,v,a(t)= E (^,u,a\U(t)\u X 'y,a') (3.2.42) 
\',v',a' 

and U(t) is the time-evolution operator. The probability of finding the 
system in the phase cell v, a at time t is given by 



Pv,a(t) = E ^{^X,u,a\U(t)\uJyy, a ')(^X,u,a\U(t)\uJy>y>y')*byyy(0)b* x// y 

(3.2.43) 



\'y, a' A 

A", v", a" 
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Invoking the argument that the summation consists of many wildly fluctu- 
ating terms and all the non-negative terms cancel i.e only surviving terms 
are v' ,a' = u", a" and A' = A". This reduces eqn(3.2.41) to 



(3.2.44) 



v',a' A' A 

Van Kampen further argues that in a sum of type 

E q *a 



where both a{,f3i are rapidly fluctuating but positive it is a good approxi- 
mation to evaluate the sum as 



With this approximation eqn ( |3.2.44j ) becomes 

Pu,a(t) = E — E \("\,v,a\U(t)\uX>y,a>)\ 2 Pv>,aiO) 



v\al "< a A, A' 
With the definition 



T t {y, a\v' ,a ) d = — ^ \(u\, u ,a\U(t)\uJx>y,a>)\ 2 



A, A' 



eqn (3.2.45) can be recast as 

Pv,a(t) = E T t {v,a\v',a')P vl y{$) 



(3.2.45) 



(3.2.46) 



(3.2.47) 



As there is nothing special about the instant t = it follows that 

Pv^h + h) = T t2 {v,a\v',a')P v ,y{t x ) (3.2.48) 
v'y 

Equivalently 



T tl+t2 (v,a\u',a') = ^ T t2 (v,a\v" ,a")T tl (v" ,a"\v' ,a!) (3.2.49) 
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Along with the obvious properties of T t {v, a\v' , a') 

T (u,a\u',a') = S Vja y >a r, T t (v,a\v',a') > 0; ^ T t {v, a\u', a) = 1 

u,a 

(3.2.50) 

which follow from the orthogonality of \oo\ iU ,a)-, \^n,v',a') for distinct {u, o}, \y' , a'} 
and the unitarity of U(t), one concludes that T 4 is a Markov Process. One 
knows from the Frobenius-Perron theorem that there is always an equilib- 
rium distribution Pfj? a . From the spectrum of the operator T(v, a\v' , a') one 
can also deduce the thermalization times. 

Before concluding this section it is instructive to derive the so called 
Master equation. For this one solves for T^t as 

T At (v,a\u',a') = 5 VA -y^{l - At ^ WV',a'V,a} + AiW^^/y (3.2.51) 

v", a" 

Then one gets the differential form of the Chapman-Kolmogorov equation 
d 

—T t (i/,a\u',a) = Y W Uja .y fja »T t (v" ,a"\v' ,a') - W u ^ all . v a T t (u, a\u',a") 

0t u",a" 

(3.2.52) 

This can be recast in terms of P u , a (t) a s 

J t Pu,a{t) = Y,i W ^y,a'P,',a' ~ W^, a / ; „, a P v ,a(t)} (3.2.53) 
u',a' 

which is the master equation. 

4 Quantum coarse graining and emergence of the 
microcanonical ensemble. 

As we have stated already the major conceptual problem in Quantum sta- 
tistical mechanics is that of understanding how a quantum state initially 
described by a pure density matrix can ever resemble the state of thermo- 
dynamic equilibrium which is clearly described by a mixed density matrix. 

4.1 Coarse graining 

As was clearly stressed by Gibbs Q long ago, coarse graining is essential to 
understanding even the most basic features of classical statistical mechanics. 
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We now introduce our proposal for a quantum coarse graining and show 
subsequently that the microcanonical distribution then follows for a large 
class of systems. 

We consider two types of mutually commuting variables /, A and their 
simultaneous eigenstates \i, a). Later on we will define \i,a) only in the 
subspace spanning the states relevant for the microcanonical ensemble. The 
i will eventually correspond to the labels of the usual micro-states entering 
the description of the statistical system and the precise nature of a are not 
specified at present. Alternately one can think of \i, a) as the labeling of 
micro-states on the fine scale and \i) as the labeling of the micro-states at 
the coarse grained level. 

It is assumed that to a very good approximation the A variables couple 
weakly to the /-variables in the total Hamiltonian i.e 

H tot = H (i) + H (a) + X H (i ' a) (4.1.1) 

with A very small. Let \A*) be the eigenstates of that lie in the range 

Eq - AE < E + A. We then form the matrix I* AB = (B*\I\A*) which is 
Ni x N{. Now let \i) be the eigenstates of /* and let \i, a) = \i) <8) \a) where 
| a) are the eigenstates of A and let us also assume that there are N a such 
states. (A special choice could be I = in which case \i) = \A*).) 

The important point is that the N{ x N a states \i,a) continue to form a 
basis in terms of which all the exact energy eigenstates in the microcanonical 
band can still be expanded. It is of course important to state that even a 
small perturbation characterized by A can completely alter the nature of the 
eigenstates of the Hamiltonian H because of the exponential crowding of the 
"unperturbed" states but the number of eigenstates remains unchanged. The 
parameters of the microcanonical band Eq,A are also most likely affected 
by the perturbation, but as we shall never need their precise values it does 
not matter for the ensuing discussion. 

We shall only consider those observables that are insensitive to the a 
labels i.e (i,a\0\j,b) = Oij5 a b- This is in keeping with the spirit of coarse 
graining described in Section 2.2. 

4.2 Some preliminaries 

Consider some arbitrary state in the subspace we have considered 

\^)=J2ci\A) (4.2.2) 
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where now \A) are the exact energy eigenstates of the system. Further, let 

\A) = E °A a M (4-2.3) 

i,a 

with the inverse expansion 

|i,a>=£ C'a \A) (4.2.4) 

A 

The coefficients C™ satisfy the following unitarity conditions: 

^ A C A = Oii'Oaa' Z^^A^A' = °AA' (4.2.5) 

A ia 

In terms of these definitions we can rewrite 

= E C b Ct (B\0\A) (4.2.6) 

A,B 

as 

CW = E E c? ^ c A a qfo« ( 4 - 2 -7) 

Now introduce 

^C^C^ a = <5 jfe Sab P A + R% (4-2.8) 

a 

This is to be understood as a diagonal + off-diagonal split in the sense that 
by definition 

Pa = Y.C J A b Cf (4-2.9) 

b 

Consequently 

Rf A = (4.2.10) 

(No sum on indices.) We need to prove a few important properties of P and 
R. Let A B; putting j = k and summing over j one gets 

E R AB = (4.2.11) 

k 

Therefore 

E^b = (4.2.12) 

k 
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always. Further, 



Pa =1 and also ^P\ = N a (4.2.13) 

k A 



as follow from the unitarity eqn (4.2.5). Likewise by putting A = B and 
summing over A one gets 

E R AA = (4-2-14) 

A 

4.3 Emergence of the microcanonical distribution 

Let us consider a pure quantum state which at t = is given by 

\t = 0) = = £ O* \A) (4.3.15) 

A 

Now it is straightforward to show that 

m)P\m) = E c a c t p i E c^cj^c^-^flW 0iJfc 

A,j ABjfc 

(4.3.16) 

Separating out the A = B contribution in the second term one finds 

(m\o\m) = E c# o,,+e <#cj£/&o iJfc 

A,j A j A 

+ E E C ^ C A E-Wa-W R% O jk (4.3.17) 

A+B jk 

We can further exploit the redundancy introduced by a states by defining 
an equivalence class of states as follows: define the unitary operator U a by 

U a \a) = Y,Kb\b) (4.3.18) 

b 

which is some unitary transformation on \a) space. Clearly it does not affect 
the expectation values of our observables of interest, O, since O is insensitive 
to a. Thus if 

u a \m) = \r(t)) (4.3.19) 

we have: 

(r(t)\o\r(t)) = (m\o\m) (4.3.20) 



32 



Consequently 

m)\o\m) = j^r E(^(*)ioiv Q («)) (4-3.21) 

Here N a is the number of such U matrices. This is up to us and we can 
choose these matrices to satisfy 

^USMd = tfabtcd (4.3.22) 

a 

If we multiply by 5 a b and sum over a, b, we get, using the unitarity of the 
matrix, 

a 

This gives N a = fxN a . The conditions ([4.3.22; ) are conditions on N a N^ 
variables. Thus we need N a = N% at least. So we take N a = N% which 
gives fj, = N a . 

It should be noted that eqn ( 4.3.21 ) is valid only at a particular instant 
t as it can be valid for all t only if U commutes with the microscopic Hamil- 
tonian H. We can think of \ip a (t)) as time evolved (after t) from some state 
\ip a ). In other words 

|^«(t)) = e- iHt \i) a ) (4.3.23) 

Again eqn ( |4.3.22| ) is only valid for time t. Actually \ip a ) explicitly depends 
on t unless [U, H] = 0. Let 

ir)= J2d a \A) (4.3.24) 

A 

Then eqn( 4,3.2l|) implies 

]T Cfe- iEAt \A) =U a Y J C%e- iEBt \B) (4.3.25) 
A B 

Fromthis it follows that 

<jf = e< EA - EB ^(A\U a \B)ci (4.3.26) 

B 

Let us evaluate (A\U a \B) now: 

(A\U a \B) = J2C*J b C%(ia\U a \jb) 

ijab 

= Y, C *B ib C'AU* a (4.3.27) 

iab 
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Thus 

<jf = e< EA - EB *C* B ia / C%u£ a C B ' (4.3.28) 

Biaa' 

On introducing 

lV>(*)>=£ D i(*)|w> (4-3-29) 

Mi 

it is easy to see that 

Df a (t) = J2 C *B a Cte- lEBt (4.3.30) 

B 

hence 

<?f = E e^'^i(t)£^« (4.3.31) 

iaa' 

We now use all this in equation ( [4.3.21 ). 

-^J2(r(t)\o\r(t)) = ±- E cpcjgoue***-*** 

(4.3.32) 

On using eqn ( |4.3.31[ ) 

ol ijacbd ot 

ijac 

= M E D tc D jt [ P A^AB^ij + R%b\ (4-3.33) 
ijc 

where we have made use of eqn( pLT3C| ) and eqn( pL2l| . Plug this into ( |4.3.32|) 
to get 

£ 0t(t)Dg{t) [P\8 AB 8 ij + I?i B ){PX8 AB 8 kl + R* A ]O kl (4.3.34) 

a ijklcAB 



We expand to get 
1 



a A 

+ ^ E DtDf c Rl B R%O kl (4.3.35) 
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Summation over all other indices as well as the time dependence of Df's is 
understood. 

If P A are such that there is a very weak dependence on either j or A, we 
can draw the following additional conclusions: 

P J A = ^ (4-3.36) 



This follows from eqn( 4.2.13 ) 



The following is one of the ways of realising eqn (4.3.36): C A C A a is the 



overlap between \i,a) and \A) states. If a state \i, a) has equal amounts 
of |^4) states then C A C A a will be more or less independent of A. This is 
equivalent to saying that the perturbation H l,a "thoroughly mixes" up the 
eigenstates. Of course in general if we plot C A C A a as a function of A, while 
we expect it to be independent of A on the average over a range, we also 
expect the plot to be full of spikes and dips. The sum over a smoothes out 
these fluctuations and should produce a smooth constant function. This is 
similar in spirit to the Berry conjecture though much weaker than it. 
On noting that YIa-^aa = 0> the final result for (ip(t)\0\i()(t)) is 



1 3 a i,j,k,l,c,A?B 

(4.3.37) 

We have used the fact that the sum over A gives a factor of N a Ni in the 
first term. Note that the time dependent part has an explicit j^- in front. 

It is imperative to show that the second term is negligibly small compared 
to the first if we are to demonstrate the emergence of the microcanonical 
ensemble. The spectrum of the Hamiltonian for a typical macro-system will 
be so complex that the second term will go to for large times. Then we 
have our desired result. 

It is also important to know how fast the second term vanishes and 
for this purpose some estimation of the magnitude of the second term is 
important. 

When the expansion coefficients Cf are identically distributed indepen- 
dent random variables such an estimation can be carried out quite accu- 
rately. In accordance with general ideas expressed in []32], |3l]], such a cir- 
cumstance could arise when the quantum system is classically chaotic. The 
nature of the eigenstates then is such that at high enough energies, all eigen- 
states look more or less the same. But it should be emphasised that our 
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approach does not really require any strong statement like this but only the 
weaker statement of eqn ( |4.3.37 ). 



The assumption about wildly fluctuating phases allows us to estimate 
the magnitude of R l ^ B - The magnitude of any given C™ can be estimated 
to be -7===. Thus R %3 AB is a sum of N a terms each of magnitude j^jf- and 

random phase. Thus we expect it to be 0(jffih) ~ 7^ ■ Also P\ « 

The off diagonal part thus has an extra factor of 7= relative to the diagonal 
part. 

We can similarly estimate the off-diagonal part in ( 4.3.37| ). We have 



(m\o\m)= E c**ctc$ct h e-< E »- E ^o jk 

j,k.A,B,b 

The off diagonal part is when A 7^ B. There are ~ N\ such terms, 

each having magnitude ■w. Using the random phase approximation and 

"a 

l at. at.. Sat - l 



including the sum over j,k,b we get « TfriY^iVjV N a = This is as 

"a vN a 

expected since the averaging over N a fine grained micro-states is expected 
to produce just such a suppression. Indeed, this was the motivation for 
introducing the a variables. This also agrees with v. Neumann's estimates 
on noting that his ^ is our N a . 

The time-dependent term is nevertheless very important for determin- 
ing the thermalization time. Thus we have the result that to a very good 
accuracy (of order — 5-^), the effective density matrix for large times is given 

by 

p = ^E w\ = jr £ lA){Al (43 - 38) 

It should be noted that the emergence of the microcanonical distribu- 
tion is happening for non-trivial systems. This can be seen by examining 



eqns ( 4.3. 16 - 4. 3. 17 ) at t = where there is no trace of the microcanonical 
distribution anywhere. 

In section 6 we will calculate the leading order time dependent correction 
in a simple example that confirms the above expectations. 

One should also ask what happens if we start off the system in an exact 
energy eiegenstate. In that case we get for the expectation value of O 

m)\om) = Y,paOu + E Ri L°a ( 4 - 3 - 39 ) 

i ij 
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Note that if O is a reasonable observable we should find that (i\0 2 \i) — 
(i\0\i) 2 < (i\0\i) 2 . This means that J2jOijOji « 0\. Thus one expects 

« \/iVj if we assume random phases or ^ « TVj if they add in phase. 
If we now take into account this relative magnitudes of Oij versus On, one 
gets that the off diagonal term is suppressed by a factor between y/N a Ni 
and y/ N a Ni . If N a is sufficiently large this could mean that for such systems 
even if the system starts of in an exact energy eigenstate it can be described 
by the microcanonical distribution. 

5 Soft Quanta and the Resonance Picture 

In this section we will attempt to describe the general phenomenon of ther- 
malisation in slightly different terms. Instead of working with the full Hamil- 
tonian of the i and a system and the exact eigenstates, let us consider the 
same system from the point of view of time dependent perturbation theory. 
We think of the i system as an atom with discrete states and the a system 
as a collection of harmonic oscillators describing long wavelength photons. 

As a first step let us treat the electro-magnetic field classically and let 
us think of the atom as a two level system. We are familiar with this 
"NMR" type of situation. When the electro-magnetic RF-pulse is the right 
frequency, we have resonant absorption and emission of photons. As we 
apply the RF pulse we expect the two level system to oscillate between 
the two states at the Bohr frequency and on the average there is equal 
probability to be in either of the two states. This happens even when the 
coupling is very weak. In terms of time independent perturbation theory 
(in the NMR case, we can go to the rotating frame and make the problem 
time independent) the off diagonal terms are very large, close to resonance, 
compared to the energy splitting. Thus the energy eigenstates have equal 
amounts of up and down spin states. 

This physical intuition can be applied to our problem. First of all the 
electro-magnetic field has to be treated quantum mechanically. Thus there 
is spontaneous emission in addition to induced emission and absorption. If 
the strength of the electro- magnetic field is large then this can be neglected. 
In oscillator language, the field oscillators should be excited to a high enough 
quantum number. 

Secondly, and more importantly we have a continuum of oscillators, both 
for emission and absorption. Here again we can apply our intuition from 
atomic physics. The effect of the continuum on the discrete state is to in- 
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troduce an anti-Hermitian term in the effective Hamiltonian that gives the 
discrete states an imaginary "width". This makes the time evolution non- 
unitary and gives the states a lifetime by making the wave function decay. 
Similar mathematical manipulations can be done for the traced density ma- 
trix calculation to show within a perturbative approach that the off diagonal 
terms decay with time and the diagonal terms tend to become equal. The 
decay of the off diagonal terms was explained in Section 3 as being due to 
random phase cancelation. Here we give arguments for seeing this more 
explicitly. Just as in the usual atomic physics calculation, the excited state 
decays by emission of energy in the form of a photon, similarly the off diago- 
nal terms also decay as the photon is emitted and absorbed repeatedly. Just 
as the photon escapes in an apparently irreversible way into the continuum, 
never to return (in the limit of the Poincare recurrence time — ► oo), so do 
the correlations escape irreversibly into the continuum. []. 

This language is also suitable for explaining intuitively why the traced 
matrix has equal diagonal terms. When the number density of soft quanta 
is large, then the probability, Pjj, of transition from i to j is equal to the 
reverse. We expect that the change in the i'th diagonal term in the density 
matrix is proportional to PjPji — PiPij- When P^ = Pji, this will stop 
changing only when Pj = Pj. This does not constitute a proof that the 
system will go toward equilibrium in general. But it allows us to intuitively 
understand a criterion for the emergence of microcanonical distribution. The 
main requirements are thus two: 

i) There should be a continuum of soft excitations with energies contain- 
ing the range of Si to AE in order for resonant absorption and emission to 
take place. In particular if Si > AE there will not be any ergodization. 

ii) The number of these should be large. We expect this number to be 
proportional to j-^ , where is the total energy in soft quanta. Since Uuj < 
AE, we have S 7 >> AE. Also one expects E » E n . Thus E » AE, as 
expected. 

If we have a large number of hard photons as well, i.e. those with fre- 
quency larger than AE, one would have to include them in the i-system. 
Otherwise the microcanonical description would not be a good approxima- 
tion. 

While the discussion in Section 4 is completely general, it is somewhat 

3 In the AdS/CFT correspondence, this continuum of soft photons represents the far 
infrared of the CFT. In the bulk, this would be the region in the interior of the black hole. 
The irreversibility described above is the irreversibility of the black hole horizon. This 
was discussed in [llf 
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abstract. The picture in this section is more intuitive. We can use this 
picture to attempt an answer to the question of when, in a given system, 
one can expect self thermalization. The difficult part presumably, is to 
identify the i and a variables unambiguously. 

6 Two Discrete States Coupled to a Continuum 

The \i) (g> \a) system can be thought of as being composed of an atom coupled 
to the electro-magnetic field. This is denoted schematically in Figure 1 
below. What is shown are the energy levels of the uncoupled system or 
equivalently, the situation where the coupling constant is set to zero. When 
the interaction is turned on we expect the energy levels of the atom to get 
broadened. 



6.1 Relaxation of Density Matrix - Thermalization 

We make the approximation namely that the i system has only two states, 
|0) and |1). Furthermore we will assume that the continuum degrees are 
those of a continuum of harmonic oscillators. Harmonic oscillators are easy 
to analyze and also representative of the real situation where we expect a 
degrees to be soft quanta. The frequencies of the harmonic oscillators are 
assumed to start at zero and form a continuum. In practice we just need 
the discrete spacing to be very small compared to the (inverse) time scales 
of interest. So if uj denotes the frequency of the oscillators, jk- = -*- » T. 




The energy levels of the "i" system. 



The continuum energy levels of the "a" systen 



Figure 2: Discrete states coupled to a continuum 



'a 
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On the other hand T should be large enough that the system has time to 
thermalize. In the previous section we saw that this time scale was set by 
7. Thus 7T > 1 and so 5uj « 7. 7 in turn is fixed by the matrix elements 
(1, 0| V|0, a). Thus here also we require that the off diagonal terms generated 
by the interaction with the a degrees of freedom must not be too small - 
this is expected whenever one has a resonance situation , i.e. Tito = AE. 
To do the calculation we will use the technique of influence functionals 



due to Feynman and Hibbs [37] 



Following [37] we let q be the coordinates of the i system [] and Q that of 
the a system. If the system starts off at time t = in the state ^(qi, Qi,0), 
then the density matrix at time t = T is given by (The subscript i stands 
for 'initial', and / for 'final') 

p(q f ,q , f ,Qf,Q , f ;T)= J... J dq t dq' l dQ i dQ' l Vq{t)Vq'{t)VQ{t)VQ'{t) 

^[S{q)-S(q')]+i[S(Q)-S(Q')]+i[S mt {Q,q)-S mt (Q\q')]^^ q M Q> 0) 

"(6.1.1) 

with boundary conditions: 

g(0) = q h q(T) = q f , Q(0) = Q fj Q(T) = Q f 

q'(0) = q[, q'(T) = q' f , Q'(0) = Q'(T) = Q' f (6.1.2) 

The action S(q) for the i-system will not be specified. However we will 
assume, as stated before that there are only two discrete states |0), |1). We 
will further make the simplifying assumption that (0|<7|0) = (1|9|1) = and 
that (0|g|l> + 0. 

The a-system is a harmonic oscillator. Actually it is an infinite number 
of harmonic oscillators, so in fact there should be an infinite number of 
Q variables. For transparency of presentation we first consider just one of 
them. As they are independent the generalization to arbitrary numbers or 
even a continuum is easy. 

The wave function, ^{qi, Qi, 0), is assumed to factorize as 

*(ft,Qi,O) = 0/(fc)x«(Qi) (6.1.3) 

Here 

\</>i) = a\0) + p\l) (6.1.4) 



4 Though the q-system is actually a two level system, we have adopted a notation 
wherein the (/-variables look continuous. The path integrals over q are to be understood 
as appropriate matrix elements in the two level system. 



40 



with \a\ + \/3\ = 1. Let us first consider the case 

X (Q)~e-^ (6.1.5) 

which is the ground state of a harmonic oscillator of frequency lo. Fur- 
thermore we take Si n t(q,Q) ~ C / dtq(t)Q(t). 

We are interested in the density matrix traced over the Q variables: 

/ dQ f p(q f , q' f , Q f , Qf,T) = p Q (q f , q' f ,T) 

If we imagine doing all the Q, Q' integrals we get an expression of the form 

PQ (q f ,q' f ,T) = J ...J ^ ^^ ig^ ^? (^)P ^? / (^)e ^ [ 5 W- 5 ^)]^(g ^ )0K%)i ? (9,'Z , )• 

(6.1.6) 

Here F(q, q') is the result of doing all the Q integrals and incorporates 
the entire effect of the Q-system, called environment by Feynman and Hibbs, 
on the (/-system. This is the influence functional. For the special case of 
quadratic S(Q) and bilinear Si n t{q,Q) it has been shown in |J7]] that it has 
the form, 

F(q, q) =e~JJ dtdt 'l9(t)-g'(t)]Mt,t')q(t')-a*(t,t')g'(t')] (6.1.7) 

For a harmonic oscillator of frequency uj, in its ground state we have 
a(t, t') = a(t — t') and also 

C 



f dre~ lVT a(r) = a(u) « -s6(v + u) (6.1. 

J iO Z 



This is what Feynman and Hibbs call a cold- environment and for such a 
situations transitions pumping energy into the system are unlikely. 

When we have a continuum of oscillators we simply have to integrate this 
expression ( 6.1.8 ) over u. We thus insert ( 6.1.7 ) into ( |6.1.6 ) and evaluate it 



perturbatively. 

6.1.1 Zeroth Order 



Poo = / dq f dq' f 4>* (q f )cj>o(q'f)pQ(qf,q'f,T) 

Pro = / dq f dq' f 4>\{q f )(t)o{q' f )pQ{qf,q f ,T) (6.1.9) 
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and so on. Thus 

(0|p|0) = (0|e" iHT (a|0) + /9|l))((0|a* + (l\P*)e lHT \0) = |a| 2 (6.1.10) 

(l|p|l) = |/3| 2 (6.1.11) 

(l|p|0) = (l|e-^ T (a|0) + P\l))((0\a* + (1| = (3*)e iHT \0) = a* ^( E i-Eo)T 

(6.1.12) 

(0\p\l) =aP*e +i ^ El - Eo)T (6.1.13) 

6.1.2 First Order 

We now go to the next order. We bring down one power of a(t,t'). 
We write it as the sum of four terms: 

$PQjo(T) = - J J dq f dq' f (f)*(q f )(f)o(qf) J Vq{t)=Vq'(t) J dqi J dq t 

j[S{q)-SV)] [a + b + c + d]<f>i(q i )<j>Kq'i) (6.1.14) 
where the subscript j on <f> can be 0, 1 and 



dt J dt'q(t)q{t')a(t,t') 
b = - f dt I dt'q'(t)q(t')a(t,t') 



dt J dt'q(t)q'(t')a*(t,t') 

d = J dt J dt'q'(t)q(t')a*{t,t') (6.1.15) 

The calculations are done in the appendix. Here we will simply give the 
result. 

We get for the first order correction to poo the following: 



?00 (T) = -T{\a\ 2 [2a R (AE) + 2a R (-AE)] - 2a R (-AE)}\(0\q\l)\ 2 

(6.1.16) 

where AE = E\ — Eq, and 

Let us first assume that AE > 0. Using Q6.1.S| ), which says that a R (AE) = 
0, we find 

6p Q00 (T) = -T[(\a\ 2 - l)2a R (-AE)]\(0\q\l)\ 2 (6.1.17) 
This goes to zero when |a| — > 1. 
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If AE < we get 



Spooo(T) = -T\a\ 2 2a R (AE)\(0\q\l)\ 2 (6.1.18) 

When Ei > Eq one expects the system to decay to its lower energy state 
|0) with the emission of a soft quanta. The decay stops when the probability 
of occupation of the lower energy state is one. Also as ( |6.1.17D shows 5p is 
increasing with time as it should be. Thus one expects 0{T) ~ /3(0)e _7T . 
So \a{T)\ 2 = 1 - \P(T) \ 2 |a(0)| 2 + |/3(0)| 2 2 7 T. Thus 5p « |/?(0)| 2 2 7 T. 
This agrees with Q5.1.17| ) for 7 = a fl (-AE)|(0|g|l>| 2 . 

Similarly when > -£<i> the probability of occupation of Eq decreases 
with time as |a(T)| 2 ?s |a(0)| 2 e _27T and so 6p « — |a(0)| 2 2 7 T as shown by 

Similarly we can look at the off diagonal terms 

5 PQ i = -Ta*Pe- l ^- E ^ T \(0\q\l)\ 2 [a(-AE) + a(Afi)*]+ 

[f T dTe- l ^- E ^ T ]aP*e- i( - El - E ^ T ((l\q\0)) 2 [a(0) + a(0)*] (6.1.19) 
./o 

If AE 1 7^ 0, the integral over r in the second term of ( |6.1.19| ) is vanishingly 
small relative to the first term. We are left with 

5 PQW = -Ta^e-^-^mq^HAEY + a(-AE)} (6.1.20) 

This is also as expected because the product a{T)(3{T) decays as e~ lT (one 
of the terms decays and the other goes to 1). 

While all this is as expected, this does not really demonstrate the De- 
coherence that we are after. In order to demonstrate decoherence we must 
show that the off diagonal terms decay even when the diagonal terms do 
not. In order to obtain a situation where the probabilities of occupation 
of the states |0),|1), are equal, one must have the external oscillators in 
excited states as well so that the system can absorb soft quanta. An easy 
route would be to let the two states be exactly degenerate so that energy 
conservation allows transitions in either direction. But in such a situation it 
is always possible to find orthonormal linear combinations of the two states 
in which the the operator q is diagonal. It is then a simple exercise to show 
that in this basis the diagonal elements of the density matrix do not change 
at all. Even on physical grounds exact degeneracy is unreasonable to as- 
sume. What is more reasonable is to consider a situation in which AE ^ 
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but "small" compared to some relevant energy scale in the problem. Nev- 
ertheless, we need to calculate influence functionals for situations where the 
system can both absorb and give energy to the quantum environment. The 
influence functional calculated in Feynman and Hibbs for the case where all 
the environment oscillators are in the ground state will not suffice. While 
the generic case of the environment oscillators in arbitrary excited states is 
too difficult to handle, the influence function for the case where the environ- 
ment states are coherent states or number states can be explicitly evaluated 
and this suffices for our purpose. 

A partial form of this result is already available in Feynman and Hi- 
bbs (see eqn 8.141). They considered the amplitude f(b,a) that a forced 
harmonic oscillator goes from state / to state g where 

/(x) = (^)i e -<3?><*- ) 2 g(x) = fei e -<3FX*-*> a (6.1.21) 
7rn irn 

Their explicit expression for f(b,a) is 

f(b,a) = Ae [-^r(a 2 +6 2 -2a6e-^)+i(^)i(ar,We— T )] (fU22) 

where 

A = e-^/ 2 e-^nlo So 7(*)7(-)e--(*-) ( g_ L23) 

Of course, the states f(x),g(x) are coherent states of the harmonic oscillator 
but with real parameters. It suffices, for our purposes, to generalise this 
result to the case where f(x) is a coherent state with complex parameter £ 
while leaving b still real. The corresponding results are 

/(x) = ( — — )4 e 2 2 e \*h>\ x V m«, (6.1.24) 
irn 



where we have followed the Pan char atnam convention for phases. 

/(&,£) = A e^^The+b*-2^tb e -^)+i(W)hV^^+bv*e-^)] 

(6.1.25) 

where 

A = e^-^ g— e -^/ Q T /; 7 (*h(.)e-Mi-.) (6.1.26) 

Now it is easily seen that Go£,G m g where Gy-j is the amplitude that the 
oscillator initially in state i is found in state / at time T, are given by 

G 05 = e-^-W" e-^So So TdhMe-**-) (6 L27) 
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and 

G mi = -i= Got (^ + V) m (6-1.28) 



Finally the influence functional is given by 

F[q,q'} = ]T G m? G" m? * (6.1.29) 

m 

The final result can be expressed as 

F ( [q,q'] eWl-rfWW-*"* F [q,tf] (6.1.30) 

where Fq is the influence functional for the case where the environment 
oscillators are in the ground state. 

Thus the environmental coherent states produce an additional influence 
functional whose exponent is a linear functional of q. However, to the degree 
of accuracy needed for our perturbative calculations we can translate this 
into an effective quadratic functional by expanding the additional term to 
second order i.e 



4M 2 to- 

It is sufficient to look at terms of the type 

t-T ft 



[ dt [ dsAa(t,s)q(t)q(s) 
Jo Jo 



to determine the one complex function Aa(t, s) that determines the addi- 
tional effective influence functional. After some algebra it follows that 

r 2 

Ao(t, s) = — V~ [2^e~ iw{t+s) + 2C 2 e iw{t+s) + 2|£| 2 (e^ ( *~ s) + e^*"^] 
AM 2 uj 

(6.1.31) 

Unlike the case of the environment oscillators in the ground state the com- 
plex function is not a function of t — s alone but it turns out that the parts 
of a(t, s) that are functions of t + s do not contribute to the transition rates 
as the integrations over s,t produce mutually exclusive delta functions and 
one is effectively left with 

Afle//(M) = S 2 ^ 2 K| 2 [( eMt_S) +e-^- s) ] (6.1.32) 

Thus when the environmental oscillators are in coherent states we have a 
mixture of a cold- environment whose strength is independent of the coherent 
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state parameter £ and a classical noise environment with or(i/) = an{—v). 
Further an{y ) is directly proportional to the level of excitation of the coher- 
ent states. It should be emphasised that any environment can be represented 
as a mixture of a cold environment and a classical noise environment. But 
what is special about the coherent state case is that the cold component is 
independent of the level of excitation of the coherent states. This will be 
crucial for us later on. 

In the number state case, there are no t + s terms to begin with (as 
shown on the Appendix) and one obtains flBjj ) directly - this is (|6.1.32|) 
with |£| 2 replaced by m, the excitation number of the oscillator. 



If we use ( B.28 ) (or ( |6.1,32 )), with large m or |£| 2 , we can neglect the 



cold term in the influence functional. In this case an(AE) = an(-AE), and 
both are non-zero. Then we see from (|6. 1.20| ) that 



Spqqo = -T(2\a\ 2 - l)|(0| (? |l)| 2 2a R (A J B) 
6p Q0 i = -Ta*f3e- iAET \(0\q\l)\ 2 2a R (AE) 



PQ01 

« p Q oi(e- 7T -l) (6.1.33) 

Thus in this approximation even when \a\ 2 = ^, Spqoi ^ 0. Equation 
( |6.1.33 ), if our extrapolation from the linear term to the full exponential 



is correct, demonstrates decoherence. To actually prove decoherence one 
would have get a more complete solution. Presumably this can be done 
numerically in some cases. 

Thus we have evidence for decoherence in an explicit calculation in this 
simplified model. The crucial point here is the appearance of the imaginary 
term due to "absorption of a soft quanta by the environment" . We have put 
the quotation marks because this is really just a convenient trick. We have 
made a division into two sets of variables i and a with the requirement that 
a should have a continuous spectrum We called these "soft quanta" . Then 
we showed that the same mathematical manipulations that give rise to an 
imaginary part to the energy of a discrete state can be used here to show 
that the off-diagonal terms of the density matrix decay. 

We should also check that the conditions set forth at the end of the last 
section are indeed satisfied. 5 a is zero for this calculation. Due to resonance, 
the off-diagonal terms are large. Thus both conditions are satisfied. While 
we have not proved even in a non-rigourous way, the existence of the phe- 
nomenon of self thermalization, the calculations done here do make plausible 



46 



the physical ideas of section 2 and also buttress the intuitive random phase 
arguments about the off diagonal terms of the density matrix decaying in 
time, that were made in section 3. 



7 Conclusions 

In this paper we have attempted to provide a physical picture of the pro- 
cess of self thermalization, by which a pure state can appear thermal if 
studied with coarse resolution. This phenomenon underlies the validity of 
the microcanonical ensemble in quantum statistical mechanics. It also has 
applications in the black hole information paradox. 

We have presented an intuitive physical picture along with a mathe- 
matical formulation of the process. The principal new ingredient was the 
introduction of dynamical "soft quanta" that are not included in the mi- 
crostate descriptions. They are coarse grained away and in the process, 
remove correlations between the (coarse grained) microstates. Thus they 
not only provide the quantum analogue of the Gibbsian coarse graining of 
microstates (Section 4), they also thermalize the system by resonant ab- 
sorpton and emission (Section 5,6). Mathematically this is identical to the 
appaearance of an imaginary part in the energy of an excited state when it 
couples to a continuum. This also provides a way to calculate the thermal- 
ization rate. A first order perturbation calculation (Section 6) supports this 
picture. 

It would be interesting to apply this picture to some interesting systems 
in a quantitative way. This would be a test of the correctness of these ideas. 
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A Appendix: Density Matrix Calculation 

In this appendix we derive (|6.1.16| ) and ( |6.1.19| ) starting from ( 6.1.14 ). 

We will first work out in detail the contribution of the perturbation 
labelled 'a'. 

The q and q' integrals can be done separately. Let us denote by Aj the 
q integral and by A' (A" does not depend on j) the q' integral. Thus 

Aj = f dq f 4>*{q f )Vqe lS ^ f dq iq (t)q(t')4>i(Qi) 
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A' = j dq'jMQ'fW^e-^dq^Uq'd 

S PQj0 (T) = [ T dt ( T dt'ait^AjA 1 (A.l) 
Jo Jo 



Aj =Jdq f J d qi 4>*(q f ) J Vqe iS Mq(t)q(t')<f>r(qi) 
= E 01e-^ (T -^|m)e-^')(m|,e-^'(a|0)+/3|l)) 

m=0,l 

If j = we get 

A = (0|e- l ^( T -*)g|l) e -^(*-*')(l|ge- lBo *'a|0) 

= e- iBoT e -^ 1 - £ °)(*-*')(0|( ? |l)(l| (? |0)Q (A.2) 

If j = 1 we get 

^ = (l| e -^i( T -*)g|0)e-^ (*- t ')(0|ge- lSlt '/3|l) 

= e- iElT e-^ E °- E;i )( t -*')(l|g|0)(0|g|l)/3 (A.3) 



A' : 



A' = [J d q 'p q \t)d q 'Mq^ Siq 'U*Wf)Y 



= [(0|e-^ T (a|0) + fll))]* = a*e iEoT (A.4) 
Combining all the above we get the contribution due to the perturbation 



a. : 



5 PQ oo(T) = \a\ 2 f dt f dt'e-^-^^ait-t'mqm 2 
Jo Jo 

= T\ a \ 2 \(l\q\0)\ 2 a(E 1 -E ) (A.5) 



and 



5 PQ10 (T) = a*Pe~ i ^~ E ^ T f dt f dt 1 e'^'^-^ a{t - t')l(lk|0)| 2 

Jo Jo 

= Ta*P\(l\ q \0)\ 2 a(E - ^g-^i-^T r A ^ 
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We can similarly calculate the contribution from the other ones. 



B = (0|e- iH(T -*' ) g|m)e-^ m *'(m|(a|0) + 

= (3e- iEoT e- l ^- E ^' {0\q\l) (A.7) 

B 1 = (l|e- iH ( T -*')g|m)e-^'(m|(Q|0) + 0\1)) 

= ae-^ T e-< E °- E ^'(l\q\0) (A.8) 



B' : 



B'=[j dq'^Uq'fWime^Mq'i)]* 



= [{0\e- iH( ?^q'\m)e- iEmt {m\{a\$) + 
Combining the above 



$PQ00 = T|/?| 2 |<0|g|l>| 2 a(£ - *k) (A.10) 

and 

rT 
10 



S PQ10 = [ [ T dre-^-^^e-^-^iiimfaiO) (A.ll) 
Jo 



(Note that the factor inside the square brackets replaces the factor T 
that appeared in previous terms.) 



C = (0|e-^ T -*)g|m)e- iBmi H(a|0) + 

= (3e- iEoT e- i( - El - Ei>)t (0\q\l) (A.12) 

d = (l|e- iH(T -* ) g|m)e- iSm *(m|(a|0) + (3\1)) 

= ae- iElT e-^ E °- E ^{l\q\0) (A.13) 



C : 
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C'=[[ d q ' f ^{ q ' } )V q \t')dq[e^'HM)Y 



= [(0|e-^ T -*Vl™>e-^ m 'Vl(a|0) +/3|1»P 

= P*e +iEoT e- i{Eo ~ El)t '(l\q\0) (A.14) 

Combining: 

= r|/3| 2 |(0|g|l)| 2 [a(£:o-^i)r (A.15) 



5p QW = [/ T (iTe-^- Bl ) r ]«re- 4{£l - £o)T ((lk|0)) 2 a*(0) (A.16) 



o 



D = (0|e- iHT (a|0) + = ae- ii?oT (A.17) 

D 1 = (l\e- iHT (a\0) + = /3e- iElT (A.18) 

D' : 

D'= [J dq' f 4>Uq'f)Vq\t) q \ty q ' t e^'Ul(q' i )r 

= [(0|e- iH(T - t) g / |m)e" iBm( ^*' ) (m|ge- im '(a|0) 

= ^e^e-^^-^^^^KOlgll)! 2 (A.19) 

Combining: 

5 PQm = T\a\ 2 m q \V\ 2 HEi ~ E )]* (A.20) 

6p Q10 = Ta*/?|(l|g|0)| 2 [a( J E 1 - £ )]* e -^- £ °) T (A.21) 
If we add up all the contributions a + b + c + d we get the expressions 



L16|) and (|6.1.1 
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B Appendix: Influence Functional Calculation 



In this Appendix we will derive the influence functional for the cases when 
the "external" oscillators are either in coherent states or in number eigen- 
states. 

We start with the kernel K(Qf,Qi,Cq) for a forced harmonic oscillator 
forced with f(t) = Cq(t) given by 



K = J — e iSd(Qf,Qi,C q ) ( B n 

V 2'Kihsin cuT 



N 

As shown in |57|] S c i is given by, 

71 jT 

S d = 2gin uT [co8 ljT(Q} + Ql) - 2Q f Qi 

2Q f C 2QiC [T . . , m . , 

+ —j — / q(t)sin ut at H — — — / q[t)sin uj(T — t) at 
Muj Jo Mlo Jo 

~ T73 — 9 / / q(t)q(s)sin uj(T — t)sin u>s ds dt] (B.2) 
M z uj z Jo Jo 

We need to calculate the following quantity: 

dQ / dQ i ^|iV| 2 e^ 5ei ( Q /' Qi ' C9 )- s = i ( /'^' C9 'V(Qi)V'*(Qi) (B.3) 

The Q oscillator starts off at time t = in a state ip. The density matrix 
at time T is then traced over the coordinates of the Q oscillator. For ip(Qi) 
we use the coherent states. Thus we use 

/(x) = (— — )4 e z 2 e ( 2n >( x Vm ?) (B.4) 

where we have followed the Pan char atnam convention for phases. It should 
be noted that eqn (B.3) straightaway gives the influence functional. The 
method followed in |37j is needlessly circuitous. After obtaining the non- 
perturbative expression for the coherent case influence functional which will 
be shown to have a part linear in q(t),q'(t), we will introduce an effective 
influence functional quadratic in q(t),q'(t) that is obtained perturbatively. 
We will obtain expressions for this effective influence functional both for the 
coherent states as well as for the number states. 
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The Q-dependent terms in the integrand in ( |B.3| ) can be written as 

e - 1 [AQ2 +BQ 2 i +CQ' 2 l +2DQ 1 Q l +2EQ fQ'^+FQ f+GQi+HQ^ ^ ^ 



with 



A = 0, B = --^-ie i » T , C = B\ (B.6) 

TlSVULaJI 

D = —— , E = D B.7 
fisiiiwi 

F = %C [ T (q - q') sin ojt dt, (B.8) 

G = %C [ T qsin lo(T - t) dt + J ^£ (B.9) 
n sin wT Jo y n 

H = [ T q'sin u(T -t)dt + t/^r (B.10) 

nsmul Jo \j n 

The result of doing the Gaussian integral is 



/(2tt)3 J_ )3 2 _|_ [F2+(G+ ^) 2+e -^ 2FG+e ^ 2FH] 
V 2 v Ma/ v 7 

The overall normalisation factor is 



TV = |iv| 2 J&V( A)a rf wT ( ^)i/2 e? 2 / 2+ r 2 /2-i^ 

V 2 Mw 7m 



= e « 2 /2+r^/2-|€l^ (B .l2) 
Thus the final result is 

Nexp{-?—[F 2 + (G + H) 2 + e~ iujT 2FG + e iujT 2FH] 

c 2 r T f l 

(t)q(s)sin u(T — t)sin ujs ds dt 



h sin ojT Muj 



o J o 

+ / / q (t)q (s)sin u(T — t)sin ojs ds dt} (B.13) 

n sin wT Mw jo Jo 



As shown in [37] the influence functional for this class of problems has 
the general form 



F(q,q') = f T [\q(t)-q'(t)][a(t,s)q(s)-a*(t,s)q'(s)] dt ds (B.14) 
Jo Jo 
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Thus we can extract a(t, s) from the q(t)q(s) terms. Calculationally it is 
easier to look at the q(t)q'(s) terms to get a(t,s) as the terms in S c i that 
are quadratic in q(t) do not contribute. We find for the a independent part 
of the q(t)q(s) term the following sum of four terms(an overall factor of -S— 
has been suppressed for the moment): 

1 



2sin 2 ujT Jo 



j(t)q(s)sin ujtsin ujs 
(t)q(s)sin uj(T — t)sin u(T 



2sin 2 ujT 



2sin 2 ujT Jo Jo 

q(t)q(s)[sin utsin uj{T — s) + sin ujssin uj{T — t)\ 



o Jo 



sin toT 



T r t 

q(t)q(s)sin uj(T — t)sin los (B.15) 



JO 



These can be rewritten as functions of t — s and t + s. The t — s terms 
are(after restoring all factors) 

C 2 1 

; [2cos u(t-s)- e~ lujT cos u(T- s + 1) 



Mlo 4sin 2 cuT 

e~ iujT cos uj(T -t + s) - 2is'mujTcos uj(T -t + s)) (B.16) 



They add up to 

C 2 cosu(t-s) sinujt- s) C 2 to(t _ s) , . 

Mu [ 2 2 1 2Mlo { ' ' 

This is the influence functional when the Q-oscillator is in its ground state 
We also have to check that the t + s terms are zero The t + s terms are 

1 1 

-cos u)(t + s) + - — r-5 — t^cos lo(2T — t — s) 



4sin 2 loT 4sin 2 wT 

ie' iwT . , i 



, , cos u(T - s - t) - — — coslv(T - t - s) (B.18) 
2sin z ojl zsincul 

They can be seen to add up to zero. 

Now we turn to the ^-dependent terms. First let us look at the terms 
quadratic in £: 

-\{^-w+ef\ = ^+e) 2 
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the exponential of which exactly cancels the factor jV. 



(B.19) 

Now we look at the terms linear in £: 



4Mlo 



On using the expression for F and some rearrangement it is easy to see that 
this equals 



iC H 



I* dt (q(t) - q\t)){ie^ T + f e^ T ) (B.20) 
J 



7i V 2Mw 
which can be reexpressed as 

iC£(rj-r)') + iCC(v* ~v'*) (B.21) 

where 

We can thus express the influence functional for the case when the Q- 
oscillator is in the coherent state as 

F^(q, q') = F (q, q') e ^(W)+*cr('<rW*) ( B .23) 

with r\ given by eqn ( |B.22[ ). Within perturbation theory we can expand the 
exponential of the linear functional in q, g'.It turns out that the linear term 
does not contribute and the quadratic terms are: 

1 ie (v-vT+e 2 (v*-^y+m\ 2 \(v-v)\ 2 ] 



4M 2 w 

It is sufficient to look at terms of the type 



[ T dt f dsAa(t,s) coh q(t)q(s) 
Jo Jo 



to determine the one complex function Aa(t, s) coh that determines the ad- 
ditional effective influence functional. After some algebra it follows that 



r 2 

Aa(t,s) coh = -^-p- [2^ e - i -(*+ s ) + 2r 2 e^ ( * +s) + 2|e| 2 (e^ ( *- s) +e- 4Lj(t - s) 

(B.24) 
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Now we turn to the evaluation of the effective influence functional when 
the Q-oscillators are in the m-th number state \m). For this we recall the 
expansion of the coherent state |£) in terms of the number states: 

cn 

IO = e-'^ 2 £ ±=\n) (B.25) 



Thus 

cn t*m 

Fs(q,q')=e-M £ ^=i— F m , n (g, </) (B.26) 
n,m V"! Vml 

Therefore to get the influence functional(in the perturbative sense described 
above) for the case in which the Q-oscillators are in the state \m) one has to 
expand e'^' F? in powers of |£| 2 (and quadratic in q(t) and pick the coefficient 
of |£| 2m .The m-th term of the expansion is 

^T" a (t, s) + fl C 2 H 2 (B.27) 

ml (m — lj! 

It is elementary to show that 

a m (t, s) = a (t, a) + m^-^ 1 ^ + e -M*-)) (B.28) 
MuJ 
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